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Abstract. We present the first spatial clustering measurements of z ~ 1, 24 /an- selected, star forming galaxies in the Great 
Observatories Origins Deep Survey (GOODS). The sample under investigation includes 495 objects in GOODS-South and 811 
objects in GOODS-North selected down to flux densities of /24 > 20 yujy and zab < 23.5 mag, for which spectroscopic redshifts 
are available. The median redshift, IR luminosity and star formation rate (SFR) of the samples are z ~ 0.8, L IR ~ 4.4 x 10 10 
L Q , and SFR~ 7.6 M Q yr~', respectively. We measure the projected correlation function w(r p ) on scales of r p = 0.06 - 10 
Mpc, from which we derive a best fit comoving correlation length of ro = 4.0 ± 0.4 h~ [ Mpc and slope of y = 1.5 ± 0.1 for 
the whole f M > 20 yujy sample after combining the two fields. We find indications of a larger correlation length for objects of 
higher luminosity, with Luminous Infrared Galaxies (LIRGs, L IR > 10" L Q ) reaching r ~ 5.1 h~ l Mpc. This would imply that 
galaxies with larger SFRs are hosted in progressively more massive halos, reaching minimum halo masses of ~ 3x 10 12 M G for 
LIRGs. We compare our measurements with the predictions from semi-analytic models based on the Millennium simulation. 
The variance in the models is used to estimate the errors in our GOODS clustering measurements, which are dominated by 
cosmic variance. The measurements from the two GOODS fields are found to be consistent within the errors. On scales of the 
GOODS fields, the real sources appear more strongly clustered than objects in the Millennium-simulation based catalogs, if the 
selection function is applied consistently. This suggests that star formation at z ~ 0.5-1 is being hosted in more massive halos 
and denser environments than currently predicted by galaxy formation models. Mid-IR selected sources appear also to be more 
strongly clustered than optically selected ones at similar redshifts in deep surveys like the DEEP2 Galaxy Redshift Survey and 
the VIMOS-VLT Deep Survey (VVDS), although the significance of this result is < 3<r when accounting for cosmic variance. 
We find that LIRGs at z ~ 1 are consistent with being the direct descendants of Lyman Break Galaxies and UV-selected galaxies 
at z ~ 2-3, both in term of number densities and clustering properties, which would suggest long lasting star-formation activity 
in galaxies over cosmological timescales. The local descendants of z ~ 0.5-1 star forming galaxies are not luminous IR galaxies 
but are more likely to be normal, L < L, ellipticals and bright spirals. 
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1. Introduction 

In the general paradigm of large scale structure formation, the 
small primordial fluctuations in the matter density field pro- 
gressively grow through gravitational collapse, leading to the 
present-day complex network of clumps and filaments which 
is often referred to as the "Cosmic Web". Baryons are believed 
to cool within dark matter halos (DMHs) and form galaxies and 
cluster of galaxies, whose distribution on the sky should then 
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trace that of the underlying dark matter. While the formation 
and the evolution of dark matter structures can be followed in 
a relatively straightforward way through N-body simulations 
(e.g., Jenkins et al. 1998; Springel et al. 2005), which can be 
also approximated analytically with high accuracy (Peacock & 
Dodds 1996), the physics of baryon cooling and galaxy forma- 
tion within DMHs is far more complex. As a result of these 
complex physical processes, the distribution of galaxies in the 
sky may be biased with respect to that of the underlying matter 
distribution. The amplitude of this bias is expected to evolve 
with cosmic time and be dependent on galaxy type, luminos- 
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ity and local environment (Norberg et al. 2002). The compari- 
son between the clustering properties of galaxies and those of 
DMHs predicted by cold dark matter (CDM) models can be 
used to evaluate the typical mass of the DMHs in which galax- 
ies form and reside as a function of cosmic time. Following the 
evolution of the typical DMH hosting a given galaxy type at 
any given time also allows one to predict the environment in 
which that galaxy should be found nowadays and the environ- 
ment in which it was residing in the past. In other words, under 
reasonable assumptions, it is possible to guess the progenitors 
and descendants of galaxy populations observed at any cosmo- 
logical epoch. 

Galaxy clustering has been traditionally studied by means 
of the two-point correlation function £(r), defined as the excess 
probability over random of finding a pair of galaxies at a sep- 
aration r from one another, which is often approximated with 
a power law of the form £(r) = (r/ro)~ y . In the local Universe 
different clustering properties have been observed as a function 
of galaxy type. In the Sloan Digital Sky Survey (SDSS, York 
et al. 2000), at a median redshift of z ~ 0.1, red, early-type 
galaxies show a larger correlation length and a steeper slope 
(r = 6.8 /r'Mpc, y = 1.9) than blue, late type galaxies (r = 
4.0/T 1 Mpc, y — 1.4; Zehavi et al. 2002). Similar results arise 
from the 2dF Galaxy Redshift Survey (2dFGRS; Colless et al. 
2001), in which, at a similar median redshift, passive galaxies 
show a correlation length and slope of ro = 6.0 /z~'Mpc,y = 1 .9 
as opposed to ro =4.1 /T 1 Mpc,y =1.5 measured for star form- 
ing galaxies (Madgwick et al. 2003). 

At cosmologically significant distances, deep surveys on 
sky patches of less than 1 deg 2 , complemented by large spec- 
troscopic campaigns, are measuring the clustering of high red- 
shift objects with reasonable accuracy. The separation between 
the clustering properties of star forming and passively evolving 
galaxies seems to be well established even at redshifts around 
1. In the DEEP2 Galaxy Redshift Survey, Coil et al. (2004) 
found that z ~ 0.9 galaxies with absorption line spectra have a 
correlation length significantly larger than emission-line galax- 
ies at the same redshift. A similar result has been found in the 
VIMOS-VLT Deep Survey (VVDS) by Meneux et al. (2006), 
who measured a correlation length that was larger for red galax- 
ies than for blue galaxies at z ~ 0.8. 

Porciani & Giavalisco (2002) and Adelberger et al. (2005) 
measured the clustering properties of star forming galaxies se- 
lected by the Lyman-break technique between redshifts of 1 .7 
and 3 (see also Hamana et al. 2004, Ouchi et al. 2005 and Lee 
et al. 2006 for Lyman break galaxies selected at z ~ 4 - 5). The 
measured comoving correlation length of 4.0-4.5 h~ x Mpc for 
these high redshift objects is expected to increase with time and 
suggests that they will evolve into moderate-luminosity, ellip- 
tical galaxies by z = (Adelberger et al. 2005). 

While all of the above described samples are based on opti- 
cal selection, star formation in galaxies can be efficiently traced 
by mid-infrared observations. The star formation rate (SFR), 
particularly the dust obscured component, is indeed directly 
correlated to the mid-IR luminosity, which is in turn a robust 
proxy for the total (8-1000/im) IR luminosity (e.g., Spinoglio 
et al. 1995; Chary & Elbaz 2001 ; Forster-Schreiber et al. 2004). 
This has been demonstrated in the present-day Universe, but 



seems to hold at least up to z ~ 1, where the bulk of star- 
formation occurs in dust-obscured regions. Indeed, the deep- 
est existing radio data have shown that Lir values determined 
from the mid-IR luminosity of galaxies at z ~ 1 are consistent 
with those derived using the radio to IR luminosity correlation 
(Elbaz et al. 2002, Appleton et al. 2004). 

Early work by the Infrared Astronomical Satellite (IRAS) 
showed that the correlation length of nearby (median z ~ 0.03) 
mid-IR bright galaxies (/eo/jm > 1-2 Jy) is about 4 hr x Mpc 
(Fisher et al. 1994), in agreement with the values measured for 
local star forming objects by the SDSS and 2dFGRS. More 
recently, an attempt to measure the clustering properties of 
mid-IR selected sources at fainter flux densities has been made 
(D'Elia et al. 2005). Based on a small sample of galaxies de- 
tected by the Infrared Space Observatory (ISO) with f\*, m > 
0.5 mJy, D'Elia et al. found that the clustering level measured 
for these z ~ 0.2 galaxies is similar to that measured by IRAS 
for more local sources. 

The Spitzer Space Telescope (Werner et al. 2004), with its 
unprecedented sensitivity at mid-IR and far-IR wavelengths, is 
enabling further progress to be made. Deep surveys at 24yum 
are being carried out in different regions of the sky (see, e.g., 
Papovich et al. 2004), with the deepest ones being performed 
in the two GOODS fields down to / 24 ~ 10 - 20piy (Chary 
et al. in preparation). For the first time, this allows us to select 
field galaxies based on their ongoing level of star formation 
activity at a wavelength where dust corrections will be negli- 
gible. This is a more physically motivated selection than those 
based on qualitative galaxy properties like color bi-modality. 
It thus provides greater insight into the nature of galaxy and 
star formation in the distant Universe and a more straightfor- 
ward comparison to galaxy formation models. Our goal is to 
investigate the spatial distribution of z < 1 star forming galax- 
ies, and assess the dependence between environment and star- 
formation rate. By constraining the nature of the descendants 
of star forming galaxies at z < 1, we provide insight into the 
nature of downsizing of galaxy formation, a well established 
pattern for galaxy evolution which implies that star formation 
is taking place preferentially in more massive galaxies at higher 
redshifts (e.g., Cowie et al. 1996). A tight correlation between 
galaxy mass and star-formation rate has been discovered, with 
slope close to unity. This correlation has been shown to exist 
both in the local Universe as well as at z ~ 1.2 (Noeske et 
al. 2007; Elbaz et al. 2007) with tentative evidence that it may 
be valid even at z ~ 2 (Daddi et al. 2007a). As more massive 
galaxies are on average hosted in more massive halos, we ex- 
pect to find a direct correlation between clustering strength and 
star formation rate in the distant Universe. 

Given the large (5-6 arcsec FWHM) resolution of the MIPS 
instrument (Multiband Imaging Photometer for Spitzer; Rieke 
et al. 2004) confusion problems due to blending are severe 
at the faintest flux densities. This makes a proper measure of 
the angular correlation function of faint MIPS sources difficult, 
leaving the 3D correlation function as the most viable method 
for estimating their clustering properties. In this paper, we mea- 
sure the spatial clustering of 24/im selected sources in the two 
GOODS fields by means of the projected correlation function 
w(r p ). Blending problems at short scales are completely over- 
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Fig. 1. Spectroscopic completeness down to zab = 23.5 mag 
for galaxies with f%4 > 20 pJy in GOODS-N (upper curve) and 
GOODS-S (lower curve). 

come in this case, as angular clustering terms are negligible as 
discussed later in the paper. 

The paper is organized as follows: in Section 2 we describe 
the data sets and the selection criteria adopted to define the 
samples used in the clustering analysis. In Section 3 we present 
the methods utilized to estimate the correlation function. In 
Section 4 several safety checks are performed to validate the 
adopted techniques. Simulations are also run to estimate errors 
on our measurements due to cosmic variance. The results of our 
analysis are presented in Section 5. In Section 6 the clustering 
measurements are discussed, interpreted and compared to esti- 
mates from optical surveys. The conclusions are presented in 
Section 7. 

Throughout this paper, a flat cosmology with O jn = 0.3 
and Qa = 0.7 is assumed. Unless otherwise stated, we refer to 
comoving distances in units of h~ x Mpc, where Hq = 100 h km 
s _1 Mpc -1 . Luminosities are calculated using h — 0.7. 

2. The samples 

The GOODS-South and GOODS-North fields, each covering 
about 10x16 arcmin, have been observed by Spitzer as part of 
the Great Observatories Origins Deep Survey Legacy Program 
(Dickinson et al. 2007, in preparation). Deep 24 pm obser- 
vations with MIPS were carried out down to sensitivities of 
~ 12yuJy (~ 3 <x) in both fields (Chary et al., in prepara- 
tion). Source catalogs at shorter wavelengths (Dickinson et al., 
in preparation) based on the Infrared Array Camera (IRAC; 
Fazio et al. 2004), were used as prior positions in order to 
improve source deblending and identify unique counterparts. 
Spectroscopic redshifts have been collected for about 60% of 
the MIPS sources with zab < 23.5 mag from a compilation of 
all the different follow-up spectroscopy programs carried out 
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Fig. 2. 24pm flux density vs redshift for sources detected 
by Spitzer/MTPS in GOODS-S (open circles) and GOODS-N 
(filled circles). Only sources with spectroscopic redshifts are 
shown. The dashed line shows the /24 = 20 piy flux limit used 
in this work. 



in the GOODS fields. In particular, for the GOODS-S field, we 
use the spectroscopic redshifts made available by Le Fevre et 
al. (2004), Mignoli et al. (2005), Vanzella et al. (2005; 2006). 
Redshifts in GOODS-N have been published in many papers 
over the years. At the redshifts of interest in this paper, the 
largest portion of the published redshifts can be found in Cohen 
et al. (2000), Wirth et al. (2004), and Cowie et al. (2004). We 
supplement these with additional redshifts for 24pm selected 
sources from Stern et al. (in preparation). The spectroscopic 
completeness down to zab = 23.5 mag is shown in Fig. Q] 
In both fields the completeness level decreases towards fainter 
magnitudes, but in GOODS-N it is systematically higher than 
than in GOODS-S. For sources with zab < 23.5 mag the com- 
pleteness level in GOODS-N is 65%, compared to 50% in 
GOODS-S. Only sources at 0.1 < z < 1.4 were considered 
in this work. The z < 1 .4 limit is imposed in order to remain in 
a redshift range where the spectroscopic sampling is highest, 
and where the observed 24pm flux density can be used as an 
accurate tracer of the total IR luminosity of galaxies. Although 
24pm observations can be used to obtain reasonable measure- 
ments of star formation activity averaged over the galaxy popu- 
lation at even higher redshifts (e.g., Daddi et al 2005), individ- 
ual sources with anomalous properties may show significant 
errors in their derived L/« (Daddi et al 2007a, Papovich et al. 
2007). Redshift quality flag information is available for most 
of the spectroscopic surveys done in GOODS-S, but is miss- 
ing for some of the surveys in GOODS-N. In GOODS-S we 
considered only objects with high quality flags. In GOODS-N 
we have excluded some galaxies (< 1% of the total sample) 
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Fig. 3. 24jt/m flux density vs zab magnitude for sources detected 
by Spitzer/NttPS in GOODS-S (open circles) and GOODS-N 
(filled circles). Only sources with spectroscopic redshifts are 
shown. The dashed line shows the fa = 20 yuJy flux limit used 
in this work. 



which appear to have incorrect spectroscopic redshifts, based 
on the shape of their spectral energy distribution and photo- 
metric redshifts. Furthermore, we have limited our analysis to 
sources with fa > 20^Jy, for which the flux density estimate is 
reliable and source confusion is well understood (Chary 2006). 
About 20% of the sources fall below this limit and are there- 
fore excluded from our clustering analysis. In total, 558 objects 
in GOODS-S and 875 objects in GOODS-N are found to sat- 
isfy these selection criteria (including AGN, see later). After 
accounting for spectroscopic incompleteness, the number of 
fa > 20/Jy sources in GOODS-S and GOODS-N differ by 
~ 20%. As shown in Section 1531 this is consistent with being 
due to cosmic variance. 

In Fig.|2]and Fig.[3]the 24-fim flux densities of sources in the 
two GOODS fields are plotted against their spectroscopic red- 
shifts and zab magnitudes, respectively. Fainter 24-fim sources 
have on average fainter optical counterparts and tend to be at 
higher redshifts, although the redshift dependence of the av- 
erage 24/vm flux density appears rather weak. Several redshift 
structures can be immediately identified, which are also traced 
by sources selected at other wavelengths (e.g., Cohen et al. 
1996; Gilli et al. 2003; Barger et al. 2003). The 24jt/m flux 
density and redshift distribution in the two fields are similar 
(see also Fig. |5J. The median 24ytzm flux density, optical mag- 
nitude and redshift for the considered samples are fa ~ 74 fiiy, 
Zab ~21 .8 mag and z ~ 0.75, respectively. We compute the total 
(8-1000/im) IR luminosity Lj R from the observed 24/mi flux 
density, assuming the luminosity-dependent model templates 
of Chary & Elbaz (2001). The total IR luminosity provides a 



Fig. 4. L/r vs redshift for sources detected by Spitzer/MlPS in 
GOODS-S (open circles) and GOODS-N (filled circles). Only 
sources with spectroscopic redshifts are shown. The dashed 
line shows the fa = 20 yuJy flux limit used in this work. 

measure of the star formation rate in the galaxy using the rela- 
tion SFR= L IR x 1.72 x 1O-'°M yr _1 (Kennicuttet al. 1998). 
We note that if more recent estimates of the stellar initial mass 
function are adopted (Kroupa 2001, Chabrier 2003), the same 
Ljr systematically converts into a ~ 30% lower SFR. The ex- 
act conversion rate does not have an important effect on our 
results. The L/« (SFR) versus redshift plot for the galaxy sam- 
ple considered here is shown in Fig. [4] along with the Lj R cut 
introduced at each redshift by the fa > 20yuJy selection. The 
luminosity distribution is similar in the two fields. The median 
luminosity and star formation rate are 4.4 x 10 10 L Q and 7.6 M 
yr , respectively. About 90% of the objects in the two fields 
have L IR > 10 10 L Q while about 30% have L, R > 10 11 L . The 
latter are classified as Luminous Infrared Galaxies (LIRGs), 
and are forming stars at an average rate of ~35 M yr -1 . 

We note that the SFR estimated from the L/« values 
may be a lower limit to the true galaxy SFR since it ex- 
cludes the unobscured star-formation traced by the observed 
UV emission. We therefore considered B-band data from the 
Advanced Camera for Surveys (ACS) onboard the Hubble 
Space Telescope (HST), which traces the rest frame UV flux 
for galaxies at z > 0.5, i.e., for the majority of the sources in 
our sample. We found that the SFR increases by only 4% on 
average when including the ACS data. We also note that the 
fraction of galaxies for which the SFR may have been underes- 
timated significantly (e.g., by a factor of 1.5-2), is less that 4%. 
Due to the fact that the UV flux may have a contribution from 
old, evolved stars, these correction factors are upper limits. Our 
estimates appear to be in good agreement with those of Bell et 
al. (2005), who derive an average UV contribution of 5-10% to 
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the global (mid-IR + UV) SFR of z ~ 0.7 star forming galax- 
ies observed by Spitzer. Furthermore, since the UV correction 
decreases with increasing SFR, it becomes completely negligi- 
ble for LIRGs. To summarize, UV corrections to the SFR do 
not have a significant impact on our results and are therefore 
neglected in the following analysis. 

While most of these mid-IR selected sources are expected 
to be star forming galaxies (elliptical galaxies should be virtu- 
ally absent from mid-IR selected samples), a significant frac- 
tion of sources may be active galactic nuclei (AGN), in which 
the radiation absorbed by circumnuclear material is re-emitted 
in the IR regime. Based on the X-ray properties of sources, we 
therefore tried to eliminate AGN interlopers. Both fields have 
been observed by Chandra with extremely deep (1-2 Msec) ex- 
posures (Giacconi et al. 2002, Alexander et al. 2003). Using 
an AGN classification similar to that adopted in Gilli et al. 
(2005), we flagged as AGN those sources with either observed 
0.5-10 keV luminosities above 10 42 erg s _1 or with a column 
density above Nh = 10 22 crrT 2 . The column density was es- 
timated by assuming an intrinsic AGN template with spectral 
index of 0.7 and absorbing it at the source redshift to repro- 
duce the observed hard-to-soft X-ray flux ratio. About 8% of 
the sources were removed from the samples using this AGN 
classification. We nonetheless verified that, due to the small 
fraction of AGN candidates, our results are insensitive to the 
methodology adopted to remove AGN. Moreover, our conclu- 
sions do not vary significant even if AGN are not excluded from 
the sample. 

After the AGN are removed, we are left with samples 
of 495 and 811 galaxies, in GOODS-South and North, re- 
spectively. One may wonder if our samples are significantly 
contaminated by AGN which went undetected in the X-rays. 
Indeed, Alonso-Herrero et al. (2006) in GOODS-S and Donley 
et al. (2007) in GOODS-N, respectively, have identified a large 
population of IR luminous galaxies showing power-law emis- 
sion in the IRAC 3.6 - 8/im bands. The power-law emission 
is thought to be due to hot dust in the vicinity of the AGN. 
Yet, half of these sources do not have an X-ray counterpart. 
We verified that none of these power-law AGN are present in 
our samples. We note that the Donley et al. (2007) and Alonso- 
Herrero et al. (2007) samples are based on shallow 24/im data, 
span a broader redshift range and primarily include objects with 
photometric redshifts. In contrast, our galaxies sample much 
fainter 24 fim flux densities and have spectroscopic redshifts of 
z < 1 .4. We are in the process of defining IR-based AGN sam- 
ples in our deep MIPS catalogs. Preliminary analysis suggests 
that < 10% of sources might be flagged as additional AGN 
candidates and in principle should be removed from our sam- 
ples. Their impact on the clustering measurements presented 
in this work is unlikely to be significant and will be discussed 
elsewhere when the AGN catalogs are finalized. Very recently, 
Daddi et al. (2007b) have shown that a population of highly 
obscured AGN, which are both undetected in the X-rays and 
do not show a power-law continuum in the IRAC bands, hide 
in about 20-30% of IR luminous (L IR > 10 12 L ) galaxies at 
z ~ 2, providing a significant contribution to their 24/im emis- 
sion (see also Fiore et al. 2007). Given the relatively low IR lu- 
minosities (Lir ~ 10 10_11 L©) and the longer mid-IR rest-frame 



wavelengths probed here at z ~ 0.7, we expect that the effect of 
contamination from an obscured AGN population will be less 
important for our study. 

It should also be noted that we are measuring the cluster- 
ing properties of mid-IR selected galaxies over a broad redshift 
range from z = 0.1 to z = 1.4. Star forming galaxies are under- 
going rapid cosmological evolution in luminosity/density over 
this redshift range (e.g., Le Floc'h et al. 2005), and the clus- 
tering strength is also likely to evolve. Although most of the 
clustering signal measured in this work is due to galaxy pairs 
at z ~ 0.7, our measurements could be returning a value for the 
clustering strength that is an average between 0< z < 1 .4. Thus, 
our analysis is not identical to that obtained by considering an 
ideally large galaxy sample in a narrow redshift interval around 
z ~ 0.7. This caveat should be borne in mind when comparing 
our results with those obtained from other surveys. 

3. Analysis techniques 

To eliminate the distortions introduced by peculiar velocities 
and redshift errors, which affect the computation of the source 
clustering in redshift space, we resort to the projected correla- 
tion function, defined as in Davis & Peebles (1983): 



w(r p ) 



g(r p , r v )dr v , 



(1) 



where %{r p , r v ) is the two-point correlation function expressed 
in terms of the separations perpendicular (r p ) and parallel (r„) 
to the line of sight, in comoving coordinates. 

If the real space correlation function can be approximated 
by a power law of the form £(r) = (r/ro)~ r and r v o = oo then 
the following relation holds (Peebles 1980): 



w(r p ) = A(y)rlr), y , 



(2) 



where A(y) = F(l/2)F[(y - l)/2]/r(y/2) and T(x) is Euler's 
Gamma function. A(y) increases from 3.68 when y = 1.8 to 
7.96 when y = 1.3. The integration limit r v o is fixed to 10 h' 1 
Mpc to maximize the correlation signal (see the end of this 
Section). 

To estimate the correlation function £(r p ,r v ) we used the 
Landy & Szalay (1993) estimator, which has been shown to 
have a nearly Poissonian variance and which appears to outper- 
form other popular estimators (e.g., see Kerscher et al. 2000): 



€(r p ,r v ) 



[DD] - 2[DR] + [RR] 
[RR] ' 



(3) 



[DD], [DR] and [RR] are the normalized data-data, data- 
random and random-random pairs, i.e., 



[DD] = DD(r p , r v ) 

[DR] = DR(r p ,r v ) 
[RR] = RR(r p ,r Y ), 



n r (n r - 1) 
nd(n d - 1) 

(«r - 1) 



2n d 



(4) 

(5) 
(6) 



while DD, DR and RR are the number of data-data, data- 
random and random-random pairs at separations r p ± Ar p and 
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Fig. 5. Redshift distribution for MIPS sources with / 24 > 20juJy and spectroscopic redshift in the GOODS-S (left) and GOODS- 
N (right) fields (AGN excluded), binned to Az = 0.01. The smooth curves show the selection function adopted to generate the 
random control sample, obtained by smoothing the observed redshift distributions and truncated at z < 0.1 and z > 1.4 as the 
data samples. 



r v ± Ar v ; and n r are the total number of sources in the data 
and random sample, respectively. 

In order to avoid confusion, we specify how galaxy pairs 
are counted. The number of DD and RR pairs have been 
counted only once, i.e., the total number of pairs in the real 
and in the random samples are nd(rid — l)/2 and n r (n r - I) 12, 
respectively. This accounts for the factor of 2 in the denomina- 
tor of Eq. 5. This way of counting DD and RR pairs has been 
adopted by e.g., Landy & Szalay (1993), Guzzo et al. (1997), 
Gilli et al. (2005), Meneux et al. (2006). Other authors, instead, 
count DD and RR pairs twice, i.e., the total numbers of pairs in 
their real and random samples are rid(rid - 1) and n r (n r - 1), 
respectively, which removes the above mentioned factor of 2 
from their formulae. These latter definitions have been adopted 
e.g., by Davis & Peebles (1983), Kerscher et al. (2000), Zehavi 
et al. (2002), Coil et al. (2004). It can be easily shown that the 
two formulations lead to the same £(r). A simple way to see this 
is to replace DD and RR in the formulae of Zehavi et al. (2002) 
and Coil et al. (2004) with 2DD' and 2RR', where DD and RR 
are the numbers of pairs counted twice, while DD' and RR' are 
the numbers of pairs counted once ("independent" pairs). 

We note that in Eqs. 4 and 5, iid is the number of sources 
observed in each GOODS field separately. Ideally, instead of 
using the observed number of sources, which may produce 
an overestimate (underestimate) of the clustering amplitude in 
under-dense (over-dense) regions, one should use the true mean 
source number, which is unknown. In principle, averaging the 
densities of the GOODS-N and GOODS-S fields would give 
a better approximation to the mean source density. However, 
because of the different spectroscopic completeness in the two 



GOODS fields, the estimate of the average density in a given 
redshift range may be non-trivial. One possibility is to assume 
that the total number of sources in the redshift range consid- 
ered in this work (z = 0.1 - 1.4) is 20% larger in GOODS-N 
than in GOODS-S. This would be comparable to the difference 
observed in the total surface density of MIPS sources (after ac- 
counting for the 65% and 50% total spectroscopic complete- 
ness of GOODS-N and GOODS-S, respectively). However, 
since the spectroscopic completeness is a function of redshift 
and optical magnitude, and the completeness curves are dif- 
ferent between the two fields (see Fig. 1), this may not be 
the case. At any rate, we have verified that, assuming that the 
z=0.1-1.4 source density is 20% larger in GOODS-N than in 
GOODS-S, the use of an averaged density value (i.e., increas- 
ing rid by 10% in GOODS-S and decreasing it by the same 
amount in GOODS-N) gives a ~ 10% shorter (longer) corre- 
lation length in GOODS-S (GOODS-N) than that estimated by 
using the density of each field separately. These fluctuations 
are of the same order as produced by redshift structures in our 
fields (see Section 5.1) and well within the cosmic variance 
errors (Section 4). Therefore, they do not change the main con- 
clusions of the paper. 

Since both the redshift and the coordinate (a, 6) distribu- 
tions of the selected MIPS sources are potentially affected by 
observational biases, special care has to be taken in creating 
the sample of random sources. We adopted a procedure that 
has been shown to work well for X-ray AGN selected in the 
same fields (see Gilli et al. 2005). The redshifts of the ran- 
dom sources were extracted from a smoothed distribution of 
the real one, which should then include the same observational 
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Fig. 6. Distribution of 24yum sources with > 20^/Jy and with spectroscopic redshifts over the GOODS fields. GOODS-S 
is shown in the left panel while GOODS-N is shown in the right panel. The GOODS-S and GOODS-N fields are centered at 
(RA 72000, DEC y 2 ooo) = (53.122368, -27.797262) and (189.215744, 62.234791), respectively. Sources in the random samples have 
been placed at the coordinates of the real sources. 



biases. We assumed a Gaussian smoothing length cr z — 0.2 as a 
good compromise between smoothing scales that are too small 
(which suffer from significant fluctuations due to the observed 
redshift spikes) and scales that are too large (where on the con- 
trary the source density of the smoothed distribution at a given 
redshift might not be a good estimate of the average observed 
value). For each of the source subsamples considered in this 
work (see Table 1), we smoothed the corresponding observed 
redshift distribution. The observed and smoothed redshift dis- 
tributions for the fn > 20yuJy samples are shown in Fig. [5] 
Due to the numerous redshift spikes observed, we did not try 
to measure the correlation function in different redshift bins 
since this would be extremely sensitive to the choice of bin 
boundaries. The coordinates (a, 6) of the random sources were 
extracted from the coordinate ensemble of the real sample in 
order to reproduce the same uneven distribution on the plane 
of the sky. This procedure will in principle, reduce the corre- 
lation signal, since it removes the effects of angular clustering. 
However, as will be verified later, in deep, pencil-beam surveys 
like GOODS, where the radial coordinate spans a much broader 
distance than the transverse coordinate, most of the signal is 
due to redshift clustering, while angular clustering contributes 
at most a few percent. The distribution on the sky of the real 
sample is shown in Fig. [6] Each random sample is built to con- 
tain more than 10000 sources. 

The source pairs were binned in intervals of Alogr ; ,=0.1, 
and w(r p ) was measured in each bin. The resulting data points 
were then fit with a power law and the best fit parameters y and 
ro were determined via^f 2 minimization. Given the small num- 
ber of pairs which fall into certain bins (especially at the small- 



est scales), we used the formulae of Gehrels (1986) to estimate 
the 68% confidence interval (i.e., l<x errorbars in Gaussian 
statistics). 

It is well known that Poisson error bars underestimate the 
uncertainties in the correlation function when source pairs are 
not independent, which is the case for our sample. More impor- 
tantly, these uncertainties do not account for cosmic variance. 
In the next Section we assess the errors to be assigned to our 
best fit parameters by measuring w(r p ) on a series of simulated 
galaxy catalogs. 

A practical integration limit r,, has to be chosen in Eq. 1 
in order to maximize the correlation signal. Indeed, one should 
avoid r v o values which are too large since they would mainly 
add noise to the estimate of w(r p ). On the other hand, scales 
which are too small, comparable to the redshift uncertainties 
and to the pairwise velocity dispersion, should also be avoided 
since they would not allow recovery of the entire signal. To 
search for the best integration limit r v ,o, we measured w(r p ) and 
the corresponding best fit ro and y values for different r v o values 
ranging from 3 to 100 lr l Mpc. Since deviations from a simple 
power law are sometimes observed (in particular for r,,o = 20 - 
50 hr x Mpc in GOODS-N), using the best fit correlation length 
or clustering amplitude A = rZ as a measure of the clustering 
level is incorrect. To overcome this problem, we chose to quote 
the w(f p ) values on a representative scale, as a function of r V Q. 
We adopt r p — 1 hr x Mpc as our representative scale, since 
it is well within the considered r p range, and is a separation 
at which the projected correlation function, w(l hr l Mpc), is 
determined with good accuracy. 
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Fig. 7. Projected correlation function w(r p ) at r p = 1 h 1 Mpc 
measured in GOODS-S (open circles) and GOODS-N (filled 
circles) as a function of the integration limit r v o (see Eq. 1). 
Errorbars take into account cosmic variance (see Section 4). 
The turndown at very large scales in GOODS-S is likely due to 
sampling noise, in the regime where r v o is much larger than the 
size of the redshift peaks (Gilli et al. 2003). 



In Fig. [7] we plot w(l h Mpc) as a function of the radial 
integration limit r V Q. We note that the signal amplitude keeps 
increasing up to r v o ~ 10 — 20 Mpc. For r v o values greater 
than 10-20/z -1 Mpc, w(l h~ l Mpc) does not vary significantly. 
In the following, we therefore fix r v o to 10 h~ l Mpc. Such a 
value for the integration limit is consistent with what has been 
widely used in the literature (e.g., Carlberg et al. 2000). 



4. Safety checks and error estimate 

We have checked to see if our method for generating the ran- 
dom sample can bias in some way the best fit correlation pa- 
rameters that we measure. In particular, placing the random 
sources at the coordinates of the real sources completely re- 
moves the contribution of angular clustering to the total clus- 
tering signal, which could bias the measured correlation length 
to lower values. We quantify this effect by considering 428 
sources within a radius of 4.8 arcmin from the center of 
GOODS-N, where the spectroscopic coverage is most com- 
plete. We measured the correlation function in two ways: first, 
by placing the random sources at the coordinates of the real 
sources, and second, by placing the random sources truly at 
random within this area. When using this second method, ro 
increases by only 4%. Therefore, most of the clustering signal 
is provided by clustering along the radial direction, validating 
the adopted technique. 



Another confirmation that this technique is not produc- 
ing biased measures comes from tests performed on the mock 
galaxy catalogs based on the Millennium Simulation (Springel 
et al. 2005). These catalogs have been obtained by model- 
ing galaxy formation through semi-analytic recipes applied to 
the pure dark matter N-body simulations of the Millennium 
run. Physical processes like gas cooling, star formation, su- 
pernovae and AGN feedback are taken into account, which 
are described in detail in Croton et al. (2006) and De Lucia 
& Blaizot (2007). Here we considered the most recent work 
by Kitzbichler & White (2007), who built a number of simu- 
lated light cones for deep galaxy surveys over 2 deg 2 sky fields. 
Each cone contains about 6.5 x 10 5 objects, for which a num- 
ber of observable and physical properties like redshift, opti- 
cal and near-IR magnitude, and star formation rate are listed. 
We considered one of these mock catalogs and applied to the 
simulated galaxies the same selection criteria adopted to de- 
fine our data samples (see details in Section Here some 
assumptions have to be made, since neither the zab magni- 
tude, not the 24yum flux density are directly available for the 
simulated sources. We used Iab as a proxy for zab, assuming 
the I - z color expected for star forming galaxies at z ~ 0.8 
((I-z)ab = 0.24, Bruzual & Chariot 2003). Also, we converted 
the model star formation rate into IR luminosity using the re- 
lation SFR= L !R x 1.72 x 10~ 10 M yr~' and then, at each red- 
shift, considered only objects above the Ljr threshold plotted 
in Fig. |U which corresponds to the /24 > 20 yuJy threshold used 
to define our data samples. The final mock sample contains 
about 50000 objects, for which we computed the projected cor- 
relation function over the same r p range used for the GOODS 
data, first placing the random control sources at the positions 
of the Millennium sources and then placing the random control 
sources really at random within the 2 deg 2 field. No signifi- 
cant variations are observed between the projected correlation 
function computed in the two cases, suggesting again that the 
contribution of angular clustering is negligible. 

As shown in Table 1, when the same selection criteria 
are applied to the Millennium galaxies, these have on average 
different redshifts and luminosities than real mid-IR selected 
galaxies. We note however that our main goal is not to se- 
lect mock galaxies with average properties identical to the real 
ones, but investigate any difference (e.g., in the average Ljr or 
SFR) between the data and the galaxy formation models once 
real and mock galaxies have been selected in the same way. 
This issue will be addressed in Sections 5 and 6. 

The mock catalogs from the Millennium simulation have 
also been used to estimate the global errors on the best fit pa- 
rameters ro and y, and to evaluate cosmic variance on the scale 
of the GOODS fields. This has been achieved by extracting 
from one of the Millennium mock catalogs samples of galaxies 
with progressively redder R - 1 colors and in the same redshift 
range as the GOODS galaxies. The clustering strength of the 
mock samples increases with redder R - 1 color threshold. We 
then split the 1 .4 x 1.4 deg field over which each sample is dis- 
tributed into 40 independent rectangles with the dimensions of 
a GOODS field (i.e., 10 x 16 arcmin). For each color sample, 
we measured the projected correlation function in each rectan- 
gle and computed the rms of the ro and y distributions. After 
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Table 1. Summary of the best fit clustering parameters. Poissonian uncertainties (only) are quoted here to allow comparison 
between different galaxy samples within the same GOODS field (see text). When comparing the results between the two fields, 
or when comparing the average properties of GOODS sources with those of other fields, cosmic variance uncertainties must also 
be included (see Table 2). 



Sample 


N" 


z range 




^IR 


ro 


r 


r (7= 1-5) 








[h 


- 1 Mpc] 




[/r 1 Mpc] 




GOODS-South 


/ 24 > 20 y uJy 


495 


0.1-1.4 


0.74 


4.58 


4.25 ±0.12 


1.51 ±0.04 


4.23 ± 0.09 


L IR > 1O 1O L 


444 


0.1-1.4 


0.81 


5.51 


4.58 ±0.13 


1.53 ±0.04 


4.52 ±0.11 


Lm > 10"L© 


161 


0.1-1.4 


1.04 


20.6 


5.22 ±0.31 


1.61 ±0.09 


5.00 ±0.29 


Lm < 1O H L 


334 


0.1-1.4 


0.67 


2.62 


4.09 ±0.15 


1.54 ±0.05 


4.03 ±0.14 


Lm > 10"L o 


63 


0.5-1.0 


0.73 


17.2 


6.21 ±0.55 


1.56 ±0.14 


6.12 ±0.51 


10 10 < L lR < 10 n L o 


177 


0.5-1.0 


0.69 


2.83 


4.18 ±0.23 


1.50 ±0.07 


4.18 ±0.17 


GOODS-North 


/24 > 20^ 


811 


0.1-1.4 


0.76 


4.26 


3.81 ±0.08 


1.52 ±0.03 


3.77 ± 0.06 


L IR > 10 10 L o 


734 


0.1-1.4 


0.80 


4.86 


4.03 ± 0.09 


1.52 ±0.03 


3.99 ± 0.07 


Lm> 10 U L Q 


218 


0.1-1.4 


0.95 


20.1 


5.05 ±0.27 


1.55 ±0.07 


4.92 ± 0.21 


Lm < 10"L o 


593 


0.1-1.4 


0.59 


2.78 


3.52 ± 0.09 


1.54 ±0.04 


3.46 ± 0.08 


Lm > 10"L o 


111 


0.5-1.0 


0.85 


18.6 


4.66 ± 0.63 


1.42 ±0.13 


4.94 ± 0.40 


10 10 < L IR < 10"L© 


320 


0.5-1.0 


0.75 


3.31 


3.42 ±0.11 


1.67 ±0.06 


3.14 ±0.10 


Millennium'' 


/24 > 20 y uJy 


49043 


0.1-1.4 


0.83 


3.6 


2.82 


1.59 


2.52 


L IR > 1O 1O L 


44114 


0.1-1.4 


0.87 


4.1 


2.77 


1.58 


2.51 


Lm > lO n L 


6423 


0.1-1.4 


1.10 


13.2 


3.31 


1.64 


2.82 


Lm < 1O"L 


42620 


0.1-1.4 


0.78 


3.0 


2.75 


1.54 


2.63 



"Number of objects in each sample. 
*Median redshift. 

^Median IR luminosity in units of 10 10 L Q . 
^Statistical errors on r and y are below 0.01. 



subtracting in quadrature the (small) term due to Poissonian 
noise, we are left with the intrinsic cosmic variance. This proce- 
dure allows us to compute the appropriate variance for sources 
that are clustered similarly to the GOODS galaxies considered. 
We found that, on GOODS-sized fields, the fractional rms of 
the correlation length increases from 14% for sources with 
ro ~ 4 h~ x Mpc to 20% for sources with ro = 5.2 h~ x Mpc, 
i.e., for populations as clustered as our total and LIRGs sam- 
ples, respectively (see the next Sections). Using the fractional 
rms values found with this method, the global errors related to 
our samples can be easily estimated once the Poissonian term 
is added back in quadrature. When averaging the properties of 
the two GOODS fields and presenting the results for the com- 
bined GOODS-S plus GOODS-N sample (see, e.g., Table 2), 
the variance estimated from the simulations is divided by a fac- 
tor VI 

We note here that the error term due to cosmic variance 
should only be considered when comparing the clustering of 
the same population of sources across different fields, while it 
should be ignored when investigating clustering trends among 
different source sub-populations in the same field. Indeed, cos- 
mic variance should increase or decrease the overall cluster- 
ing amplitude over a given sky region, without modifying sig- 
nificantly the relative clustering between different galaxy sub- 
samples (e.g., sources with different L/r), provided that their 
redshift distributions are similar, i.e., sources in the different 
subsamples are tracing the same large scale structures. For this 



reason, in Table 1 we quote only Poissonian uncertainties, suit- 
able for comparison between different samples within the same 
field. When comparing the properties of the same population of 
sources between GOODS-N and GOODS-S, the cosmic vari- 
ance term should be included. When this is done, we find that 
that the clustering amplitudes measured in the two fields are 
fully compatible with each other (see next Section). In Table 2 
we quote the clustering parameters averaged between the two 
samples, with uncertainties that include cosmic variance. 

The Millennium mock catalogs, in which large source sam- 
ples can be selected to minimize statistical noise, were also 
used to check if limiting the integration radius r^o to 10 h~ x 
Mpc may introduce a systematic bias on our clustering mea- 
surements. We selected a population of mock galaxies with 
R - I > 0.65, which shows a clustering level similar to that 
of our MIPS sources (ro ~ 4 h Mpc), and measured w(r p ) 
as a function of the integration radius r V Q. We found that for 
r V Q = 30 h~ x Mpc the clustering signal already saturates, and 
we verified that for r v o = 10 hr x Mpc the ro value is bi- 
ased low by 5% with respect to the full, saturated value. In 
the Millennium catalogs, "purely cosmological" redshifts are 
also available which are free from peculiar velocities. We used 
these to compute the correlation function in redshift space £(r) 
for the same mock sample, which should provide an unbiased 
measurement of ro. The resulting ro is in very good agreement 
with that measured from w(r p ) for r^o > 30 hr x Mpc and there- 
fore confirms that when using r v o = 10 hr x Mpc, ro is biased 
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Fig. 8. Projected correlation function measured for the total 
/ 24 > 20//Jy MIPS samples in GOODS-S and GOODS-N (open 
and filled circles, respectively) compared with that obtained 
from the Millennium simulation on a 2 deg 2 field (filled tri- 
angles). Errorbars for the GOODS samples take into account 
cosmic variance (see Section 4). The best fit power laws are 
shown as dashed lines. 

low by 5%. We therefore conclude that the r measurements 
presented in this work could underestimate the real values by 
~ 5%. At any rate, we do not try to correct for this small sys- 
tematic bias since it is found to be well within the uncertainties 
due to cosmic variance. 

Finally, one may wonder if the fitting procedure to w(r p ) 
adopted in the previous Section, in which a simple Poisson 
weighting of the datapoints is used without considering the ef- 
fects of cosmic variance, may bias the best fit parameters r 
and y. We verified that, when attributing to each w(r p ) data- 
point the cosmic variance error as a function of r p resulting 
from our simulations, the best fit parameters ro and y are es- 
sentially unchanged. In the GOODS-N field ro and y change 
only by ~ 2%. In the GOODS-S field the change is smaller 
than 1%. This is due to the fact that the datapoints guiding the 
fits in both procedures are those with r p in the range 0.5 -4 A -1 
Mpc, which have both smaller Poisson errors and cosmic vari- 
ance. In the following we will therefore keep using the fitting 
procedure described in Section 3. 

5. Results 

Having defined the analysis methods to estimate the galaxy 
projected correlation function and the global errors related to 
it, we are now in the position to measure the clustering proper- 
ties of star forming galaxies in GOODS-S and GOODS-N and 
to compare them with those expected for mock galaxies from 
the Millennium simulation. Also, the clustering properties of 



Fig. 9. Best fit correlation length (upper panel) and slope (lower 
panel) measured over 40 mock fields obtained by splitting the 
2 deg 2 Millennium field into independent rectangles with the 
dimensions of a GOODS field. The average ro and y values 
(solid lines) and dispersion (shaded areas) are also shown. 



different source subsamples, defined e.g., on the basis of their 
IR luminosity, can be readily investigated. 

5.1. Correlation function of the full GOODS-S and 
GOODS-N samples 

We first measured the projected correlation function for the to- 
tal GOODS-N and GOODS-S samples over the projected scale 
range r p = 0.06 - 10 hr l Mpc. The results are shown in Fig. [8] 
In both fields a clear clustering signal is measured, with very 
high significance (> 35cr). The best fit parameters (ro, y) are 
4.25 h~ x Mpc, 1.51 in GOODS-S and 3.81 ft" 1 Mpc, 1.52 in 
GOODS-N (see Table 1). The clustering amplitude therefore 
appears about 10% larger in GOODS-S than in GOODS-N, 
confirming that the GOODS-S field has more structure than the 
GOODS-N field, as already noted from X-ray selected sources 
(Gilli et al. 2005). As shown in Fig. [8] most of the excess sig- 
nal in GOODS-S is produced at projected scales in the range 
0.8 < r p < 3h~ l Mpc, while at smaller and larger scales the sig- 
nals measured in the two fields are almost identical. A simple 
check was performed by computing the projected correlation 
function in the GOODS-S field after removing those sources 
within the two redshift spikes at z = 0.67 and z = 0.73, which 
showed that most of the excess signal at 0.8 < r p < 3 hr x Mpc 
is indeed produced by these two structures. At any rate, as it 
will be shown later, the difference among the two ro values is 
fully accounted for by cosmic variance. 

It should be noted that in the r p = 0.06 - 10 h~ l Mpc scale 
range considered here, the datapoints at the smallest and largest 
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scales are the least reliable. At small scales, e.g., r p < 0.1 h 
Mpc, source pairs at high redshifts (z > 1.2) have separa- 
tions on the plane of the sky comparable to the MIPS angular 
resolution at 24-fim. Therefore source blending may be an is- 
sue. Furthermore, other biases might be introduced by the dif- 
ferent angular selection functions of the many spectroscopic 
campaigns from which our catalogs are built. Also, the trans- 
verse size of the GOODS fields (19 arcmin diagonal) becomes 
smaller than r p ~ 8 h~ l Mpc for pairs at z < 0.5. The corre- 
sponding w(r p ) measurements may thus be distorted with re- 
spect to those at smaller scales because of the different redshift 
range sampled. At any rate, the datapoints at the smallest and 
largest scales have the largest errorbars and thus do not signif- 
icantly affect the overall estimate of the best fit parameters ro 
and y. Indeed, when repeating the fits limiting the r p range to 
0.1 - 8 A -1 Mpc (or even 0.4 - 8 h~ l Mpc), we obtained results 
in agreement with the previous ones within the errors. In the 
following computations we simply considered datapoints from 
r p - 10 hT x Mpc all the way down to the smallest scale from 
which we get signal. 

At scales r p < 0.3 h Mpc, the correlation function data 
points appear to lay above the best fit power law, which may 
indicate that the intra-halo clustering term, i.e., the clustering 
term due to galaxy pairs within the same dark matter halo, is 
emerging, as has recently been seen in very large galaxy sam- 
ples (e.g., SDSS, Zehavi et al. 2004). However, because of the 
possible biases in the w(r p ) datapoints at smaller r p scales men- 
tioned above, the observed small-scale excess should be con- 
sidered with caution. We will return to this in the Discussion. 

The clustering behavior measured for the GOODS sam- 
ples appears markedly different from the expectations from the 
Millennium simulation. As explained in the previous Section, 
we computed the projected correlation function for a sample 
of about 50000 objects in a mock galaxy catalog based on 
the Millennium run after applying the same selection criteria 
used for the real data. The projected correlation function for 
the mock catalog is also shown in Fig. [8] and the best fit clus- 
tering parameters are quoted in Table 1 . Simulated mid-IR se- 
lected sources appear much less clustered than real sources. 
The overall w(r p ) shape is also very different, with a flattening 
below 0.8 h~ l Mpc, as opposed to the steepening observed in 
GOODS, and a steepening above r p ~ 3 - 4 Mpc, whereas 
the GOODS w(r p ) appears to have a constant sloped 

A similar discrepancy between the predictions based on the 
Millennium mock catalogs and the real data has also been re- 
ported by McCracken et al. (2007), who measured the angular 
correlation function (ACF) of /-band selected galaxies in the 
COSMOS field. While at bright magnitudes the COSMOS and 
the Millennium ACF are in good agreement, at fainter magni- 
tudes, I > 22 mag, Millennium sources are less clustered than 
the real COSMOS sources, with an overall correlation func- 
tion shape very similar to the one we measured for Millennium. 
In the same work, McCracken et al. (2007) point out that the 



1 The subtle differences in the cosmological parameters adopted in 
this work with respect to those in the Millennium simulation (f2 m = 
0.25, S1 A = 0.75, h = 0.73) are unlikely to have any significant impact 
on our results. 



Table 2. Combined GOODS-S plus GOODS-N sample. The 
uncertainties take into account cosmic variance and have been 
computed as described in Section 4. 



Sample 


z range 


r [h' 


1 Mpc] 




r 


/ 24 > 20 /Jy 


0.1-1.4 


4.03 


+ 0.38 


1.51 


±0.08 


L, R > 10'°L G 


0.1-1.4 


4.31 


±0.47 


1.52 


±0.08 


L IR > 10"L o 


0.1-1.4 


5.14 


±0.76 


1.58 


±0.10 


Ur < 10"i Q 


0.1-1.4 


3.81 


±0.36 


1.54 


±0.08 



observed discrepancy cannot be accounted for by cosmic vari- 
ance. 

We checked to see if the discrepancy we find can be as- 
cribed to cosmic variance by dividing the 2 deg 2 simulated 
mock field into 40 non-overlapping rectangles with the same 
size as that of the GOODS fields (i.e., 10 x 16 arcmin) and 
measuring average and dispersion of the ro and y distributions 
over these regions. As shown in Fig|9] we found ro = 2.58, 
o> = 0.25 for the average correlation length and its disper- 
sion, and y = 1.50, cr y = 0.10 for the average slope and its 
dispersion. Repeating this exercise on two other independent 
2 deg 2 mock catalogs yielded similar results. 

The correlation lengths measured in the GOODS-S and 
GOODS-N fields then appear to be about 6 and 5 standard 
deviations, respectively, larger than the value measured from 
the Millennium catalog. It therefore seems unlikely that the 
stronger clustering measured in the GOODS fields be produced 
by cosmic variance. Several possible explanations for this dis- 
crepancy are investigated in the Discussion, as well as a series 
of caveats that have to be kept in mind when comparing models 
with observations. 

It is interesting to note how the average correlation length 
and slope measured on these 10 x 16 arcmin mock subsam- 
ples are smaller than those measured for the full 2 deg 2 mock 
catalog and reported in Table 1. One reason is that at large pro- 
jected separations, where the Millennium w(r p ) is steeper, the 
relative weight of the w{r p ) datapoints is much higher in the 
full 2 deg 2 field than in any GOODS-sized field, since distant 
galaxy pairs are much better sampled. As an example, over the 
whole r p - 0.06 - 10 h~ l Mpc range considered in this work, 
the number of pairs in a typical GOODS-sized field is maxi- 
mum in the range r p — 3 - 6 h~ x Mpc, while in the full 2 deg 2 
field it steadily increases towards larger projected separations. 
Another reason may be related to the effects of the integral con- 
straint (Groth & Peebles 1977), which bias the measurements 
of the correlation function on finite size fields. We estimate that 
the bias introduced by the integral constraint may affect the 
w(r p ) estimates by at most a few percent at the largest scales 
probed here (above 5 hr x Mpc). 

5.2. Dependence of clustering on IR-luminosity/star 
formation rate 

Recent observations have shown that, among star-forming 
galaxies at any redshift, the star formation rate appears to be 
correlated with the galaxy mass (Noeske et al. 2007; Elbaz 
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Fig. 10. Left: projected correlation function for sources with Lj R > 10 10 L o (SFR > 1.7 M Q yr~') as measured in GOODS-S, 
GOODS-N and Millennium simulation (open circles, filled circles and filled triangles, respectively). Errorbars for the GOODS 
samples take into account cosmic variance (see Section 4). The best fit power laws are shown as dashed lines. Right: as in the 
left panel but for LIRGs, i.e., objects with L, R > 10 11 L Q (SFR > 17 M yr" 1 ). 



et al. 2007; Daddi et al. 2007a). This is in agreement with 
the predictions from semi-analytic models of structure forma- 
tion (Finlator et al. 2006; Kitzbichler & White 2007), though 
models also predict that this correlation breaks down for the 
most massive galaxies. It is therefore interesting to investigate 
if and how the clustering of galaxies depends on the IR lumi- 
nosity, which is a good proxy for the star formation rate. We 
measured the projected correlation function for sources with 
L IR > 10 10 L Q and for LIRGs (L IR > 10 n L Q ), as shown in 
Fig. [TO] In both fields we measure an increase of the cluster- 
ing level with IR luminosity, with ro going from ~ 4 h Mpc 
for the whole samples to ~ 5 h _i Mpc for the LIRGs (see also 
Table 1 and 2). A comparison between the correlation length 
of the different samples is shown in Fig[13]for the combined 
GOODS-S plus GOODS-N fields. Because of the unavoidable 
degeneracy between luminosity and redshift which character- 
izes any flux limited sample, LIRGs are on average at higher 
redshifts than the full IR galaxy population. However, as re- 
ported in Table 1, while the median luminosity of LIRGs is 
about a factor of 5 larger than that of the total sample, their 
median redshift of z ~ 1 .0 is not dramatically higher than that 
of the total sample, z = 0.75. The modest difference in the me- 
dian redshift for the two samples suggests that luminosity, not 
cosmic time, is the main factor contributing to the clustering 
dependence that we observe. Because the dark matter cluster- 
ing is smaller at higher redshift, the difference would be even 
larger for the implied galaxy bias. Since ro for a given galaxy 
population is expected to increase with time, i.e., towards lower 
redshifts (see Section |6~4"l ), properly accounting for the redshift 



differences between subsamples would actually strengthen the 
detection of IR luminosity segregation of clustering. 

In order to properly establish the statistical significance 
of the trend of clustering versus luminosity, we also consid- 
ered sources with Lj R < 10 11 L Q (non-LIRGs), which there- 
fore constitute a source sample disjoint from the LIRGs (see 
Table 1). The difference between the clustering correlation 
length of LIRGs and non-LIRGs is about 3<x and 5<x significant 
in GOODS-S and GOODS-N, respectively. As explained in 
Section 4, only the Poissonian errorbars quoted in Table 1 have 
been considered for this estimate. However, since the redshift 
distributions of the LIRGs and non-LIRGs samples are rather 
different (e.g., the median redshift for LIRGs is z ~ 1.0, while 
for non-LIRGs it is z ~ 0.6 - 0.7; see Table 1), this evidence 
must be investigated further since the two populations might 
not be tracing the same large scale structures. We have there- 
fore restricted our analysis to the redshift range z = 0.5 - 1.0, 
which allows us to compare LIRGs and non-LIRGs at similar 
median redshifts (see Table 1). Fig. fTTI and fT2l show the red- 
shift distributions and the projected correlation functions w(r p ) 
measured for the z = 0.5 -1.0 LIRGs and non-LIRGs in the 
GOODS-S and GOODS-N field, respectively. Because of the 
limited source statistics, we used larger r p bins (Alog r p =0.2) 
than those previously adopted, and limit our analysis to the 
r p = 0.4 - 8 h~ x Mpc range, where the w(r p ) measure is more 
robust. We found that the significance of stronger clustering 
of LIRGs decreases slightly, to ~ 2 - 4<x, when performing 
this more appropriate comparison at similar median redshifts. 
Although the measured correlation lengths are quite sensitive to 
the choice of the redshift bin boundaries because of the spiky 
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nature of the observed redshift distributions, we note that we 
systematically measure larger correlation lengths for LIRGs 
than for non-LIRGs, even adopting other redshift intervals. We 
conclude that our data suggest an increase of the correlation 
length with average L/« or SFR, although this result needs to 
be confirmed using larger samples with better statistics. 

As in the case of the total sample, we compared the results 
from the GOODS fields with those from the Millennium simu- 
lation. In Fig.[T3lthe ro values of the samples with L/r > 10 10 L Q 
and Lir > 10 u L o in the redshift range z=0. 1-1.4 for the com- 
bined GOODS-S plus GOODS-N sample (see Table 2) are 
plotted as a function of the sample median luminosity and 
compared with the expectations from mock samples extracted 
from Millennium using the same Ljr thresholds. Since in each 
Millennium sample the median Lm is lower than in the corre- 
sponding GOODS sample (see Table 1) -and this is especially 
true for LIRGs- we also measured w(r p ) for mock sources 
above 2 x 10 11 L , which have the same median luminosity 
of GOODS LIRGs. Again, we used the 40 GOODS-sized sub- 
regions of the 2 deg 2 full mock field to obtain the average cor- 
relation length and dispersion for model galaxies selected at 
different luminosities. This is shown by the shaded region in 
Fig. [13] Even at high luminosities, the overall clustering of the 
data appears stronger than that predicted by the simulations, 
although with reduced significance. 

As noted above, among galaxies with /24 > 20/zJy, the frac- 
tion of IR luminous objects is lower in the mock catalog than 
in GOODS. As an example, the fraction of LIRGs is 13% in 
Millenium, as opposed to the 30% in GOODS (see Table 1). 
This is related to the fact that, as emphasized by Elbaz et al. 
(2007), Millennium galaxies are forming stars at rates * 3 
times lower than those which are observed at z ~ 1 . We have 
verified that artificially increasing the SFR of all model galax- 
ies (i.e., independent of their positions within the simulation) 
by this amount does not change our conclusions, as it would 
imply even smaller correlation lengths all luminosities (as can 
already be argued from Fig.fTSl. 

The AGN removal performed on our sample does not 
significantly affect the best fit correlation lengths or slopes. 
However, two points are worth noting. First, the fraction of 
AGN candidates is higher among LIRGs (17%) than in the total 
samples (8%), consistent with what observed for IRAS galaxies 
in the local Universe, where a higher fraction of AGN is found 
in more luminous IR objects (e.g., Sanders & Mirabel 1996). 
Second, a small (~ 5 — 7%) systematic decrease of the cor- 
relation lengths is observed when AGN are removed from the 
samples, which is consistent with the fact that AGN in GOODS 
(which have ro = 5 - 10 /T 1 Mpc, Gilli et al. 2005) are more 
strongly clustered than is the full IR galaxy population. 

5.3. Implications for the cosmic variance of 24fim 
source counts 

The measured clustering level of star forming galaxies implies 
that important field-to-field variations should be observed in the 
number counts of these sources. As discussed in Section 2, we 
have in fact found that the surface densities in GOODS-N ver- 
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Fig. 11. Upper panel: redshift distributions and selection func- 
tions for LIRGs and non-LIRGs in GOODS-S. Sources in the 
z = 0.5 -1.0 redshift interval used to compute the projected 
correlation function w(r p ) shown in the lower panel have been 
shaded. Lower panel: projected correlation function w(r p ) mea- 
sured in GOODS-S for LIRGs and non-LIRGs in the redshift 
interval z - 0.5 - 1.0. Poisson errorbars are used here since 
the comparison is performed between samples with similar red- 
shift distributions in the same field. The best fit power laws are 
shown as dashed lines. 



sus GOODS-S field differ at the 20% level, once spectroscopic 
incompleteness is taken into account. Given our direct cluster- 
ing measurements, we can verify a posteriori if this difference 
may be understood in terms of cosmic variance in the counts. 
The expected total variance in the counts can be expressed as: 



a 2 =N(l +NxIC) 



(7) 



where N is the average number of galaxies observed and IC 
is the integral constraint (see, e.g., Daddi et al. 2000 for defi- 
nitions), which depends on the angular clustering amplitude A 
and can be related to it following Roche et al. (1999). We have 
used the best fit clustering parameters ro and y, Limber's equa- 
tion, and the observed redshift distribution functions (Fig. [5J to 
compute that sources with fan > 2Qp}y should have an angu- 
lar clustering amplitude of A(l°) ~ 0.008. Given the values of 
the angular correlation amplitude and slope, and the size of the 
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Fig. 13. From top to bottom panel: best fit correlation length, 
slope and amplitude, for the total, L/r > 10 10 L Q and Lm > 
10 11 L samples obtained by combining the GOODS-S and 
GOODS-N fields (see Table 2). The best fit clustering param- 
eters are plotted at the sample median Lm- The shaded areas 
show the average and dispersion of the best fit clustering pa- 
rameters measured over 40 mock fields with the dimensions of 
a GOODS field (see text for details). 



GOODS fields we infer an integral constraint of 0.13. By in- 
serting these values in Eq. 7 one sees that NxIC » 1, i.e., that 
fluctuations in the number counts of galaxies with > 20yuJy 
in GOODS-sized fields are dominated by clustering (i.e., cos- 
mic variance) rather than counting (Poisson) statistical uncer- 
tainties. We expect fluctuations at the level of 35% (lcr) in 
the counts for > 20yi/Jy galaxies in GOODS-sized fields, 
fully explaining the observed difference between GOODS-S 
and GOODS-N. 

6. Discussion 

6.1. Comparison with other galaxy samples atz ~ 1 

Deep redshift surveys such as V VDS and DEEP2 are providing 
an accurate census of the galaxy population at z ~ 1, measur- 
ing in particular the dependence of galaxy clustering on sev- 
eral parameters such as the galaxy spectroscopic type, color 
and luminosity. In both surveys, galaxies which can be identi- 
fied as star forming appear to have a correlation length smaller 
than that measured for our GOODS 24/im selected sample, al- 
though the significance of this difference is still limited. In de- 
tail, Coil et al. (2004) find r = 3.2 ± 0.5 \iT x Mpc for emis- 
sion line galaxies in DEEP2 (~ 1.3cr lower than that for the 
total GOODS 24/j.m sample), while Meneux et al. (2006) find 
ro = 2.5 + 0.4 hT x Mpc for star forming, blue galaxies in the 



VVDS (~ 2.7cr lower than the total GOODS 24/mi sample). 
The main difference between the GOODS sample considered 
here and those from DEEP2 and VVDS resides in the selection 
at mid-IR versus optical wavelengths. The required detection 
of sources at 24/vm for GOODS (in particular the requirement 
of fiA > 20 fiJy) imposes a lower limit to SFR of about 2.5 M 
yr at z ~ 0.8 (see Fig. @), while optical selection (Iab < 24 
mag and Rab < 24.1 mag for VVDS and DEEP2 galaxies, re- 
spectively) does not translate as directly into a SFR. Indeed, 
because of older stars and dust extinction, even galaxies with 
very similar optical properties could span a very wide range 
of star formation rates. We verified that if we impose a cut in 
SFR or 24/im flux density on the Millennium mock catalogs, 
many low-SFR objects excluded from the sample would be in- 
cluded if a simple optical magnitude cut had been used instead 
(e.g., zab < 23.5 mag, the limit for optical spectroscopy of 
GOODS sources considered here). In fact, the median SFR of 
Millennium mock sources increases by a factor of ~ 6 when 
the additional mid-IR cut is included. Therefore, in optically 
selected samples, star forming galaxies are expected to have 
a lower star formation rate on average than that of our MIPS 
sources. The trend discussed in the previous Section, in which 
ro is larger for samples selected at increasing Ljr (or SFR), is in 
line with this interpretation. In connection with the above con- 
siderations, it is interesting to note that the strong clustering 
level measured for GOODS LIRGs appears then to be more 
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similar to that measured for passive galaxies than for moder- 
ately star forming galaxies at z ~ 1 (Coil et al. 2004, see also 
Fig. fT4l) . Since the amplitude of galaxy clustering is directly 
related to the galaxy mass (on average, more massive galaxies 
reside in denser, i.e., more clustered, environments), this re- 
sult is in agreement with the observed dichotomy for massive 
galaxies at z < 1 .2, most of which either have already ceased 
forming stars, or are doing so at very high rates (Noeske et al. 
2007; Elbaz et al. 2007). 

6.2. Comparison with predictions of galaxy formation 
models 

In Section 5 we showed that MIPS detected sources in 
the GOODS fields appear to be significantly more clustered 
than expected from galaxy formation models based on the 
Millennium simulation (Kitzbichler & White 2007). One may 
wonder if this discrepancy can be ascribed to uncertainties in 
the SFR to Ljr conversion, since Ljr is the available (although 
indirect) measurement for real data, while SFR is the primary 
output for mock galaxies. Under different assumptions on the 
stellar IMF the overall uncertainties in the SFR to Ljr relation 
can be quantified to about 30%. We verified that a 30% vari- 
ation of the 24jum flux density threshold in the mock catalog 
does not alter significantly the Millennium correlation function. 

As emphasized by Elbaz et al. (2007), at z ~ 1 Millennium 
galaxies are forming stars at rates about a factor of 3 lower 
than observed galaxies. As far as object selection is concerned, 
artificially increasing the SFR of model galaxies is equivalent 
to selecting galaxies in the mock sample at lower 24/mi flux 
densities. This selects many more sources, which are in gen- 
eral less clustered since the lower tail of the SFR distribution 
is now being sampled. We checked that reducing the limiting 
/24 flux density by a factor of 3 produces a lower correlation 
function for Millennium sources, thus reinforcing the discrep- 
ancy with the real data. To be fair, it should be noted that sim- 
ulated galaxies are free from some of the observational selec- 
tion effects which affect real data in our samples and compli- 
cate a direct comparison. For example, at the faintest flux lim- 
its of / 2 4 ~ 20yuJy, where S/N~ 5 for MIPS detections, we 
might be failing to detect sources in crowded regions or close 
to brighter mid-IR targets. We expect this should be a small 
effect, but not entirely negligible and in any case difficult to 
properly simulate. Also, the 50-65% spectroscopic complete- 
ness may introduce a bias if sources with measured redshifts 
have different clustering properties from sources without red- 
shifts (i.e., if sources with redshifts are not a random sam- 
pling of the full population). For example, some tendency is 
detected in both fields for larger spectroscopic completeness 
at brighter z-band magnitudes (see Fig. 1). Therefore the ob- 
served discrepancy between the GOODS data and the mock 
catalogs from Millennium should be considered by keeping in 
mind those caveats. 

At any rate it is interesting to investigate what could be 
a likely ingredient that has to be modified within the semi- 
analytic models in Millennium to explain the observed discrep- 
ancy. We suggest here that a possible weakness in the models 
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Fig. 14. Correlation length and space density of GOODS 'all' 
{fin > 20//Jy) and LIRGs galaxy samples considered in this 
work are compared to that of other galaxy populations at z ~ 1, 
as labeled. The trend predicted from the Millennium simulation 
for dark matter halos at z ~ 1 above different mass thresh- 
olds is also shown as a shaded region. More massive halos 
(log of the threshold mass is labeled) are less abundant and 
more clustered than less massive ones. GOODS IR galaxies 
and the absorption line galaxies of Coil et al. (2004) appear 
more abundant than the halos that can host them (i.e., having 
the same ro value), suggesting the presence of more than one 
galaxy per halo. As discussed in the text, the corresponding 
IR galaxies and LIRGs at z ~ 1 in the mock galaxy catalogs 
based on Millennium appear significantly less clustered than 
observed in GOODS. Moreover, Millennium LIRGs are also 
significantly less abundant than GOODS LIRGs. Values plotted 
for Millennium LIRGs and IR galaxies were derived averaging 
measurements in 40 GOODS-sized mock fields. 

is the SFR algorithm adopted for the mock galaxies. Indeed, 
within simulated dense environments like galaxy clusters and 
groups, a very abrupt cut-off of gas-cooling is applied to galax- 
ies as soon as they become non-central. Therefore, simulated 
satellite galaxies might be not forming stars at sufficiently high 
rates, which would indeed reduce the correlation length of the 
star forming simulated population as well as their number den- 
sity (see the next Section). 

6.3. The connection with dark matter halos 

While at small scales, comparable to the dimensions of dark 
matter halos, the clustering of a given galaxy population is dif- 
ficult to predict because of merging and interactions that can 
trigger a number of physical processes, at larger scales (e.g., 
> 1 h Mpc), where galaxy interactions are rare, the galaxy 
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correlation function should follow that of the hosting dark mat- 
ter halos. An interesting consequence is that one can estimate 
the masses of the typical halos hosting a given galaxy pop- 
ulation by simply comparing their clustering level (see, e.g., 
Giavalisco & Dickinson 2001). Indeed, according to the stan- 
dard ACDM hierarchical scenario, dark matter halos of differ- 
ent mass cluster differently, with the more massive halos be- 
ing more clustered for any given epoch, and it is then straight- 
forward to compute the correlation function for halos above a 
given mass threshold. It is worth noting that since less mas- 
sive halos are more abundant, the correlation function of halos 
above a given mass threshold is very similar to the clustering of 
halos with mass close to that threshold. Also, it is important to 
note that as far as our clustering measurements are concerned 
(see Section 5), the w(r p ) datapoints at large scales (r p > 1 h~ l 
Mpc) have smaller errorbars and guide the power law fit (see 
Fig. 8). Therefore the measured ro and y values are essentially 
due to the clustering signal at large scales, where the galaxy 
correlation function follows that of the dark matter, allowing a 
meaningful comparison with the clustering expected for dark 
matter halos. 

We considered the dark matter halo catalogs available for 
the milli-Millennium simulation^, a reduced version of the 
Millennium run which includes 1/5 12 of the full simulated vol- 
ume. Halo catalogs are available at different time steps along 
the simulation. Here we considered those at z ~ 1 (param- 
eter stepnum=41 in the simulation). In total there are about 
32000 halos with mass above 10 10 M in a cubic volume of 
62.5 lr l Mpc on a side. We computed the correlation func- 
tion and the space density of halos above mass thresholds of 
log(M/M o )=10.8, 11.2, 11.6, 12.0, 12.4, 12.8. Here we use as 
halo mass estimator the simulation parameter m_Crit200, de- 
fined as the mass within the radius where the integrated halo 
overdensity is 200 times the critical density of the simulation. 
The results are shown in Fig. [14] where it is readily evident 
that more massive halos are more clustered and less numer- 
ous. The halo region plotted in Fig. [14] takes into account the 
fluctuations in the halo space density due to cosmic variance on 
volumes equal to the milli-Millennium volume (see Section [531 
and Somerville et al. 2004 for a description of the methods to 
derive the fluctuations in the source counts from the clustering 
parameters). 

We computed the space density of sources in our GOODS 
samples and compared the ro and density values of our pop- 
ulations with those of other galaxy populations at z ~ 1 and 
with those of dark matter halos at z ~ 1 as computed above. 
Comparable values for the space densities of GOODS sources 
were found when considering the full z = 0.1 - 1.4 redshift 
range or a restricted redshift interval (z = 0.7 - 1.2) around 
the peak of the selection function. The comparison is shown 
in Fig. [14] Conservative uncertainties of 50% have been con- 
sidered in the galaxy space densities, which should take into 
account the fluctuations due to cosmic variance as well as the 
uncertainties in the volume effectively spanned by the consid- 
ered galaxy populations. By comparing the halo and the galaxy 
r values, one can immediately see that /24 > 20 yt/Jy star form- 

2 see http://www.g-vo.org/Millennium 



ing galaxies are hosted by halos with masses > 8 x 10 11 M , 
while LIRGs, which are more clustered, are on average likely 
hosted by more massive halos with M > 3 x 10 12 M . The pop- 
ulation of absorption-line galaxies by Coil et al. (2004) also ap- 
pears to be hosted by massive halos (M > 5 x 10 12 M ), while 
their emission line galaxies seem to reside in smaller halos with 
M > 4 x 10 11 M Q . When looking at their space densities, 
/24 > 20 fiJy star forming galaxies (and LIRGs) and absorp- 
tion line galaxies at z ~ 1 appear more abundant than halos that 
can host them, i.e., there is likely more than one such galaxy 
per halo. This is consistent with our measurements of w(r p ). 
Indeed, as shown in Fig. 8, the clustering signal is well detected 
down to very small scales of r p - 60 h l kpc, well within the 
typical size of dark matter halos. As an example, the average 
half-mass radius for Millennium halos with M > 8 x 10 11 M , 
i.e., those which likely host GOODS IR galaxies, is about 100 
hT x kpc. Therefore, most of the signal at scales r p < 0.3 h' 1 
Mpc is likely dominated by galaxies within the same halos (i.e., 
the so-called intra-halo term) and a steepening of w(r p ) is in- 
deed consistently observed at these scales (Fig. 8). A fully con- 
sistent analysis of mid-IR galaxy clustering within the halo oc- 
cupation number (HOD) theoretical framework (e.g., Peacock 
& Smith 2000; Moustakas & Somerville 2002; Kravtsov et al. 
2004) is however beyond the scope of this paper. 

To conclude this Section we note that Millennium sim- 
ulated star forming galaxies and LIRGs at z ~ 1 are less 
clustered than observed in GOODS and that, moreover, ob- 
served LIRGs appear significantly more abundant than those in 
Millennium (Fig. 12). This further supports the interpretation 
that, at z ~ 1, many galaxies within dense environments such 
as groups or clusters are forming stars at high rates, in contrast 
to the star formation history assumed in the Millennium simu- 
lation. The model's scarcity of star forming galaxies in dense 
environments, e.g., within the same dark matter halo, may be 
also responsible for the observed flattening of the Millennium 
correlation function towards small scales (see Fig. 8). 

It is not clear yet what is the main driver of star forma- 
tion in galaxies at z ~ 1 . On the one hand, a correlation be- 
tween star formation rate and galaxy mass is observed (Noeske 
et al. 2007; Elbaz et al. 2007). On the other hand, as found in 
this work, higher star formation rates are hosted by galaxies in 
denser environments. These two results are perfectly consistent 
one another (and with the conclusions of Elbaz et al. 2007 and 
Cooper et al. 2007), since more massive galaxies are indeed 
located in dense environments, but it is hard to establish what 
is the ultimate driver for the star formation increase: is it the 
galaxy mass or the environment? In other words, is the star for- 
mation rate in each galaxy simply linked to the gas mass and 
triggered at a given time along the galaxy life almost indepen- 
dently of the environment or, instead, are environmental effects 
necessary to produce gas instabilities and trigger star forma- 
tion? Solving these issues is beyond the scope of this paper. It 
will require much larger samples of star forming galaxies with 
spectroscopic redshifts, with which one will be able to study 
clustering of galaxies versus their star formation rates in nar- 
row mass bins. 
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Fig. 15. Bias (upper panel) and correlation length (lower panel) 
for the total, L IR > 1O 1O L and L, R > 10 n L Q combined 
GOODS samples quoted in Table 2, compared with evolution- 
ary tracks computed according to a conserving scenario (solid 
lines, see text for details). The shaded area shows the ro evolu- 
tion of z ~ 2 star forming galaxies as computed by Adelberger 
et al. (2005). 

6.4. Descendants and progenitors ofz~ 0.5-1 star 
forming galaxies 

Under simple assumptions, the spatial clustering of an extra- 
galactic source population measured at a given epoch can be 
used to estimate the typical dark matter halos in which these ob- 
jects reside, and then to estimate their past and future history by 
following the halo evolution in the cosmological density field. 
A useful quantity for such analyses is the bias factor, defined as 
b 2 (r,z,M) = ^ g (r,z,M)/^„(r,z), where £ g (r,z,M) and £ m (r,z) 
are the correlation function of the considered galaxy population 
and that of dark matter, respectively. In general the bias param- 
eter can be a function of scale r, redshift z, and object mass M. 
For simplicity we adopt the following definition here: 

b 2 (z) = { g (8,z)/U8,z) (8) 

in which £g(8, z) and £ m (8,z), are the galaxy and dark matter 
correlation function evaluated at 8 h Mpc, respectively. The 
galaxy correlation function has been measured directly in this 
work, while the dark matter correlation function can be esti- 
mated using the following relation (e.g., Peebles 1980): 

US,z) = oi(z)/J 2 (9) 

where J 2 = 72/ [(3 - y)(4 - y)(6 - y)2 y ] and o-\{z) is the dark 
matter mass variance in spheres of 8 hr x Mpc comoving ra- 
dius, which evolves as cr%(z) = os(Q)D(z). D(z) is the lin- 
ear growth factor of perturbations, while cr 8 = cr 8 (0) is the 



rms dark matter fluctuation at present time, which we fix to 
cr 8 = 0.8 in agreement with the recent results from WMAP3 
(Spergel et al. 2007). While in an Einstein - De Sitter cosmol- 
ogy the linear growth of perturbations is simply described by 
Dec/s (z) = (1 +z) , in a A-dominated cosmology the growth of 
perturbations is slower. We consider here the so-called growth 
suppression factor g(z) = D(z)/DEds(z) as approximated ana- 
lytically by Carroll, Press & Turner (1992). 

The above relations allow us to estimate the bias of the 
galaxy population at its median redshift. One can further as- 
sume that the spatial distribution of the observed galaxy popu- 
lation simply evolves with time under the gravitational pull of 
growing dark matter structures. This scenario, in which galaxy 
merging is considered negligible, is often called the galaxy con- 
serving model and in this case the bias evolution can be approx- 
imated by 

b(z) = 1 + [HO) - 1]/D(Z) (10) 

where b(Q) is the population bias at z = (Nusser & Davis 
1994, Fry 1996, Moscardini et al. 1998). 

Once b(z) is determined, the evolution of f g (8, z) and hence 
of ro(z) can be obtained by inverting Eq.[8] The best fit y ~ 1.5 
values found in this work are assumed in the above relations. 
In Fig. Q3] we show the evolution of b(z) and ro(z) for the 
combined GOODS samples reported in Table 2. Star form- 
ing (/24 > 20 //Jy) objects at z ~ 0.7 are expected to have 
ro ~ 6 - 7 lT x Mpc at a redshift of 0.1. Since local early type 
galaxies with L < L, have been observed to be clustered 
that strongly in the SDSS and 2dFGRS (Zehavi et al. 2002; 
Madgwick et al. 2003), at least part of them could descend from 
Z ~ 0.7 star forming objects. Similarly, some of the brighter 
(L ~ L») ellipticals in the local Universe, for which ro ~ 8 h 
Mpc has been measured (Guzzo et al. 1997, Budavari et al. 
2002) could descend fromz ~ 1 LIRGs (L, R > 10 11 L ), which 
are expected to evolve into a population with ro ~ 7 - 8 h 
Mpc by z — 0. This would be consistent with the recent find- 
ings by Cimatti, Daddi & Renzini (2006), who observe a lower 
number density of L < L* early type galaxies at z ~ 0.8 than 
at z = 0, suggesting that at least part of local ellipticals have 
formed since z ~ 1 . 

The slope of the correlation function for local ellipticals is 
generally found to be steeper than that observed for GOODS 
IR galaxies. Slopes of y ~ 1.9-2 have indeed been mea- 
sured for local ellipticals (Guzzo et al. 1997, Zehavi et al. 2002, 
Madgwick et al. 2003), as opposed to y ~ 1.5 - 1.6 for GOODS 
star forming galaxies measured in this work. While an aver- 
age steepening of the matter correlation function and of the 
overall galaxy population is expected towards lower redshifts 
(see, e.g., Kauffman et al. 1999, Moustakas & Somerville 2002) 
since the clustering level progressively increases at smaller 
scales, the clustering evolution in the proposed galaxy conserv- 
ing scenario above is computed by assuming a fixed (y — 1 .5) 
slope. Also it has to be kept in mind that the galaxy conserving 
scenario is an ideal, rather extreme, representation of galaxy 
evolution, since it, by definition, neglects galaxy merging. It is 

3 In the R band, the characteristic luminosity of z ~ early type 
galaxies L» is M* = -21.5 (Baldry et al. 2004). 
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therefore somewhat misleading to determine the descendants 
of a high redshift galaxy population simply based on the ro 
comparison without considering the slope. A z ~ 1 star form- 
ing galaxy does not evolve automatically into a z = elliptical 
and perhaps subsamples of the local spiral galaxy population 
may have the clustering properties expected for the descen- 
dants of z ~ 1 star forming galaxies. In an SDSS-based paper, 
Budavari et al. (2002) have analyzed the clustering properties 
of z ~ 0.2 galaxies with different spectral energy distributions 
(SEDs) corresponding to those of galaxies with different mor- 
phological types. They found that bright (-23 < Mr < -21) 
galaxies with SEDs corresponding to the morphological type 
Scd have a correlation length of ro = 6.75 h~ x Mpc, similar to 
those of ellipticals at the same redshift, but with a shallower 
slope y ~ 1 .7. We suggest that part of the GOODS LIRGs pop- 
ulation may then evolve into bright, massive spirals. By adding 
the space densities of local ellipticals and bright spirals one fur- 
ther sees that this is similar to what is measured for z ~ 1 star 
forming galaxies. 

Recently, Adelberger et al. (2005) measured the clustering 
of star forming galaxies at z ~ 1.5-2 (BM and BX samples) 
and at z — 3 (LBGs, see also Giavalisco & Dickinson 2001). 
By comparing the galaxy correlation function with that of dark 
matter halos in the ACDM-GIF simulation (Kauffmann et al. 
1999), Adelberger et al. (2005) found that UV selected galax- 
ies at z ~ 2 are hosted by halos with masses around 1O 12 M . 
Furthermore, by following the evolution of these halos in cata- 
logs computed at subsequent time steps in the simulation, they 
were then able to infer the correlation length of the descen- 
dants of the z ~ 2 galaxy population. At z < 1 they find that 
the only galaxy population with clustering strong enough to be 
consistent with that of the expected descendants of UV selected 
galaxies are red absorption line dominated galaxies from Coil 
et al. (2004). In Fig. [15] the expected evolution of z ~ 2 star- 
burst galaxies as computed by Adelberger et al. (2005) is also 
shown. The clustering length of LIRGs at z ~ lis large enough 
to be consistent with the one predicted for the descendants of 
UV selected galaxies. Moreover, the correlation slopes of the 
two populations are similar (y ~ 1.5 - 1.6). The average SFR 
of UV-selected galaxies is also of the same order of that of 
LIRGs (35 M yr on average.) It is therefore possible that 
LIRGs at z ~ 0.5-1, in addition to passive galaxies, may be the 
direct descendants of UV-selected galaxies. This would imply, 
in turn, that star formation in these galaxies is sustained, either 
continuously or intermittently, over cosmological timescales of 
a few Gyrs and suggests they assemble stellar masses up to 
~ 10 M Q from z ~ 3 to z ~ 1. Our conclusions on the z ~ 1 
descendants of high redshift star forming galaxies add to those 
reached by Adelberger et al. (2005), who, based on the com- 
parison with the correlation lengths measured in the DEEP2 
surveys, identify passive absorption line galaxies at z ~ 1 as 
the descendants of their LBG population. DEEP2 star forming 
objects were on the contrary ruled out based on their small cor- 
relation length. As explained in the previous Section, the low 
correlation length of emission line (star forming) galaxies in 
the DEEP2 survey can be ascribed to a SFR on average lower 
than that measured for our LIRGs. Our results suggest that star 
formation is intense in a significant fraction of massive objects 



at z ~ 1 and that the descendants of high redshift star-forming 
galaxies have not necessarily stopped forming stars at z ~ 0.5- 
1 . If we consider that LIRGs and passive galaxies at z ~ 1 have 
similar space densities (~ 2.5 - 3 x 10~ 3 Mpc -3 , Fig.fPfl). and 
that their combined density is of the order of the LBG space 
density (~ 4 - 6 x 10~ 3 Mpc~ 3 ), then we can conclude that a 
significant fraction of z ~ 2 star forming galaxies might still be 
forming stars at z ~ 1 . 

7. Summary and conclusions 

We present the first measurements of the spatial clustering 
of star forming galaxies at z ~ 1 selected at 24//m by 
Spitzer/MlPS in the GOODS-S and GOODS-N fields. The cor- 
relation length for the total combined sample has been found 
to be 4.0 + 0.4 ft -1 Mpc, the r value in GOODS-S being 
~ 10% larger than in GOODS-N. We estimate the uncertain- 
ties in our measurements using mock catalogs extracted from 
the Millennium simulation, which show that the GOODS-S 
and GOODS-N measurements are fully consistent with the ex- 
pected cosmic variance on these 160 arcmin 2 fields. We find 
indications for an increase of the correlation length with Ljr 
(or SFR), with LIRGs having r ~ 5.1 ± 0.8 hT x Mpc. The 
measured correlation length in the GOODS mid-IR selected 
samples appears larger than that measured in optical samples 
of star forming galaxies at z ~ 1 such as those in the DEEP2 
or the VVDS surveys. Although the significance of this result 
is still limited (1 - 3cr), it might be interpreted as evidence that 
the average star formation rate in optically selected samples of 
emission line galaxies is lower than that of our samples, which, 
by selection, have larger IR luminosity. This is in agreement 
with the observed relation between IR luminosity and cluster- 
ing strength, which, in turn, suggests that at z ~ 1 more intense 
star formation is hosted by more massive (i.e., more clustered) 
systems. 

The measured correlation length is significantly larger than 
that expected from the Millennium simulations, once the se- 
lection criteria adopted to define the real data samples are ap- 
plied to the mock samples. This suggests that star formation is, 
on average, occurring in dark matter halos that are more mas- 
sive than those predicted by the galaxy formation model imple- 
mented in the Millennium simulation by Croton et al. (2006). 
By comparing the clustering of GOODS star forming galaxies 
with that of Millennium dark matter halos, we find that more 
luminous galaxies are hosted by progressively more massive 
halos, with LIRGs residing in halos with M > 3 x 10 12 M o . 
Since the measured LIRG space density is higher than that of 
the hosting halos, each halo appears to contain on average more 
than one LIRG. This is also supported by the steepening of the 
correlation function observed towards smaller scales, which is 
usually interpreted as due to galaxy pairs within the same dark 
matter halo (intra halo clustering). 

Based on a galaxy conserving scenario, in which it is as- 
sumed that galaxies observed at a given redshift evolve without 
merging, simply pulled by the surrounding density field, we 
trace the time evolution of the bias parameter and of the cor- 
relation length of z ~ 1 star forming galaxies. By comparing 
the evolved correlation lengths with those of local and high- 
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redshift galaxy samples, we infer the likely descendant and pro- 
genitors of our z ~ 1 sample. We find that objects in our sample 
may evolve into L < L* ellipticals or bright spirals by z = 0, 
with LIRGs evolving into bright L ~ L* objects. Similarly, 
LIRGs, together with passive absorption line galaxies at z ~ 1, 
may be identified as the descendants of UV-selected star form- 
ing galaxies at z ~ 2. 
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